Supporting Case-Based Retrieval by Similarity Skylines: Basic Concepts and Extensions
نویسندگان
چکیده
Conventional approaches to similarity search and case-based retrieval, such as nearest neighbor search, require the specification of a global similarity measure which is typically expressed as an aggregation of local measures pertaining to different aspects of a case. Since the proper aggregation of local measures is often quite difficult, we propose a novel concept called similarity skyline. Roughly speaking, the similarity skyline of a case base is defined by the subset of cases that are most similar to a given query in a Pareto sense. Thus, the idea is to proceed from a d-dimensional comparison between cases in terms of d (local) distance measures and to identify those cases that are maximally similar in the sense of the Pareto dominance relation [2]. To refine the retrieval result, we propose a method for computing maximally diverse subsets of a similarity skyline. Moreover, we propose a generalization of similarity skylines which is able to deal with uncertain data described in terms of interval or fuzzy attribute values. The method is applied to similarity search over uncertain archaeological data.
منابع مشابه
Supporting Case-Based Retrieval by Similarity Skyline
Conventional approaches to similarity search and case-based retrieval, such as nearest neighbor search, do require the specification of a global similarity measure which is typically expressed as an aggregation of local measures pertaining to different aspects of a case. Since the proper aggregation of local measures is often quite difficult, we propose a novel concept called similarity skyline...
متن کاملUESTC at ImageCLEF 2012 Medical Tasks
This paper describes the methods used and results archived by our research group in the ImageCLEF 2012 medical retrieval and classification tasks. We performed three sub-tasks, ad-hoc retrieval, case-based retrieval, and modality classification. For the retrieval tasks, we combined semantic-based retrieval with traditional text-based retrieval. The semantic-based retrieval was conducted by comp...
متن کاملCase Retrieval Nets: Basic Ideas and Extensions
An eecient retrieval of a relatively small number of relevant cases from a huge case base is a crucial subtask of Case-Based Reasoning (CBR). In this article, we present Case Retrieval Nets, a memory model that has recently been developed for this task. The main idea is to apply a spreading activation process to the case memory structured as a Case Retrieval Net in order to retrieve cases being...
متن کاملKIDS Lab at ImageCLEF 2012 Personal Photo Retrieval
The personal photo retrieval task at ImageCLEF 2012 is a pilot task for testing QBE-based retrieval scenarios in the scope of personal information retrieval. This pilot task is organized as two subtasks: the visual concepts retrieval and the events retrieval. In this paper, we develop a framework of combining different visual features, EXIF data and similarity measures based on two clustering m...
متن کاملOptimizing Nearest Neighbor Retrieval by Similarity Template and Retrieval Query Generation
The nearest neighbor algorithm is the most basic class of techniques in the subelds of machine learning such as case-based reasoning (CBR), memory-based reasoning (MBR), and instance-based learning (IBL). In the nearest neighbor algorithm, the computational cost of example retrieval is one of the most important issues. This paper proposes a novel technique for optimizing the nearest neighbor al...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008